Optimality of qubit purification protocols in the presence of imperfections 
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Quantum control is an essential tool for the operation of quantum technologies such as quantum 
computers, simulators, and sensors. Although there are sophisticated theoretical tools for developing 
quantum control protocols, formulating optimal protocols while incorporating experimental condi- 
tions remains a challenge. In this paper, motivated by recent advances in realization of real-time 
feedback control in circuit quantum electrodynamics systems, we study the effect of experimental 
imperfections on the optimality of qubit purification protocols. Specifically, we find that the op- 
timal control solutions in the presence of detector inefficiency and non-negligible decoherence can 
be significantly different from the known solutions to idealized dynamical models. In addition, wc 
present a simplified form of the verification theorem to examine the global optimality of a control 
protocol. 



1. INTRODUCTION 

Pure quantum states, states of systems with minimum 
classical uncertainty, are an ideal resource for many tasks 
in quantum information processing, including tclcporta- 
tion, quantum coding and error correction [1]. However, 
frequently the states of systems encountered in the lab- 
oratory are mixed quantum states that contain classical 
uncertainty about various aspects of the particular physi- 
cal system of interest. Ordinarily this uncertainty can be 
removed, and the state purified, by an appropriate mea- 
surement or cooling procedure. Such purification is often 
a first, and critical, step in many quantum information 
processing, communication, and metrology protocols. 

For many physical systems a measurement is properly 
treated as a finite timescale dynamical process as op- 
posed to an instantaneous projective operation. In such 
systems, the measurement timescale (the time taken to 
complete a measurement and collect enough information 
to distinguish between the possible classical outcomes) 
is long enough that one can perform operations on the 
system during the measurement process. Such measure- 
ments are referred to as weak measurements. Some ex- 
amples of quantum information systems that can operate 
in regimes of weak measurement are quantum dots mon- 
itored by quantum point contacts [2], and cavity QED 
implementations in optics [ ], or the solid-state [4]. 

In the case of purification implemented by a weak mea- 
surement, it is natural to ask whether it is possible to 
accelerate the purification process by performing oper- 
ations during the measurement. Jacobs showed in Ref. 
[5, 6] that in an ideal scenario it is possible to increase 
the instantaneous rate of purification by adding feedback 
operations that are unitary rotations conditioned on the 
information collected by the weak measurement thus far. 



Specifically, Jacobs showed that in the case of a single 
qubit with perfect efficiency measurement, no decoher- 
ence, and arbitrarily strong feedback (what we will call 
the ideal case) , a feedback strategy that always maintains 
a two-level quantum system (qubit) in an unbiased basis 
with respect to the constant measurement basis results 
a maximization of the instantaneous rate of purification. 
We shall refer to this protocol as the unbiased measure- 
ment protocol . Subsequently, Refs. [ ] generalized 
this result and showed that in the ideal case it is possi- 
ble to utilize feedback to increase the instantaneous rate 
of purification for arbitrary finite dimensional quantum 
systems. Wiseman and Ralph [10] have noted that it is 
useful to separate two different goals in the task of quan- 
tum state purification: the first goal, which we refer to as 
max purity, is that of maximizing the average purity of 
the system at a given time, while the second goal, which 
we refer to as min time, is that of minimizing the aver- 
age time taken to achieve a given purity. These authors 
show that Jacobs' unbiased measurement strategy (which 
consequently maximizes the instantaneous rate of purifi- 
cation) is advantageous for the max purity goal, while 
a diagonal measurement strategy, which measures in the 
diagonal basis of the qubit state (and requires no feed- 
back), is better for the min time goal. In fact, Wiseman 
and Bouten [11] later proved that in the ideal case the 
unbiased measurement strategy is the optimal one for 
purifying qubits with the max purity goal and that the 
diagonal measurement strategy is optimal for purifying 
qubits with the min time goal. This highlights another 
reason why quantum state purification is an important 
problem in quantum control theory. It is one of very few 
problems in this domain where questions of optimality 
can be constructively addressed. In contrast to this sit- 



It is implicit in the name that this protocol requires feedback to 
maintain the unbiased state. 
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uation for the state purification problem, the optimality 
of most quantum feedback protocols cannot be assessed 
in a constructive manner. 

All the above works address the problem of quantum 
state purification in the ideal case where the measure- 
ments are of unit efficiency (i.e., where the measurement- 
induced state disturbance is equally compensated by a 
gain in information about the state [ ] ) and the feedback 
action is arbitrarily fast. Both idealizing assumptions 
must typically be relaxed in realistic systems. In Ref. 
[13], Griffth et al. relax the arbitrarily fast feedback as- 
sumption and consider the performance of both the unbi- 
ased measurement and diagonal measurement protocols 
for purifying the state of a superconducting Cooper pair 
box qubit. More recently, Combes and Wiseman [14] 
have analyzed the impact of a wider array of imperfec- 
tions on the unbiased measurement protocol for purifi- 
cation, including Unite strength feedback, time delay in 
the feedback loop, calibration errors, measurement inef- 
ficiency, and decoherence. Both of these studies indicate 
that the acceleration of purification rate by feedback is 
severely hampered by practical constraints. 

In this work, we extend the study of quantum state pu- 
rification by studying the optimality of purification pro- 
tocols in the presence of key experimental imperfections. 
The imperfections we consider are measurement ineffi- 
ciency and extrinsic decoherence as a result of environ- 
mental noise. Both these imperfections will be present 
in most quantum information processing architectures. 
Hence it is important to consider their effects on purifi- 
cation, and also to formulate optimal strategies for pu- 
rification in their presence. We do not consider the im- 
perfections arising from finite strength feedback or time 
delay in the feedback loop, since it is much more difficult 
to analyze optimality in the presence of these features. 
The remainder of the paper is structured as follows. In 
section 2 we introduce the physical system and dynam- 
ics we will analyze. In section 3, we find the time local 
optimal strategy for purification in the presence of im- 
perfections. Section 4 discusses the global optimality of 
the locally optimal strategy for the max purity goal, us- 
ing a simplified form of the verification theorem (as part 
of our calculations we derive a simplified form of the ver- 
ification theorem [15] for verifying global optimality of 
control protocols, which is presented in appendix A). We 
also compare the global optimality of the local strategy 
formulated here against other known protocols. Then the 
global optimality of local strategy for the min time goal 
is discussed in section 5. A summary and conclusions are 
presented in section 6. 



2. DYNAMICS 

We restrict our attention to the case of a quantum two- 
level system (qubit). Although this is the simplest finite 
dimensional system, it is also the most relevant from a 
quantum information perspective since physical imple- 



mentations of qubits are the fundamental building blocks 
for most quantum information tasks. In addition, to ex- 
amine the effect of imperfections on state purification, it 
suffices to examine the case of a qubit. 

The system is subject to a weak, continuous measure- 
ment of an operator M with strength k and efficiency 
rj. In addition, we assume the qubit is coupled to a low 
temperature environment which induces relaxation and 
decoherence dynamics on the qubit. The master equa- 
tion describing the time evolution of a qubit with such 
dynamics is given by [12, 16] 

dp =jiD[<r-]pdt + -/ 4> V[a z ]p/2dt + V[V2kM]pdt 

+ H[^/2k~qM]pdW, (1) 

where p is the qubit density matrix, dW is a Wiener 
increment satisfying dW 2 — dt, and we have set h = 
1. The super-operators in this equation are defined 
as: V[A]p = ApA^ - \{A^ Ap + pA^A) and U\A\p = 
Ap + pA^ — Tr[(A + A')p\p. This equation is in a rotat- 
ing frame with respect to a free Hamiltonian of the form 
\uj{t)cj z , and M and p should be interpreted in this ro- 
tating frame. Here we have utilized the Born and Markov 
approximations of the noisy environment in order to sum- 
marize its effects on the qubit as Markovian dephasing 
at rate 74, and relaxation at rate 71. Since the environ- 
ment is considered to be at low temperature {ksT <C lj), 
we only consider its de-excitation (relaxation) effects on 
the qubit. The time-dependent measurement results, or 
measurement current, that generates conditioned evolu- 
tion by Eqn. (1) can be expressed as: 

I(t) = VkTv[(M + M+)p(t)] + m/VV- (2) 

where £(t) = dW/dt in a white noise process. In this pa- 
per, we assume that the measurement is along the com- 
putational basis axis, i.e. M = J z = a z /2 (note that this 
M has no time dependence in the rotating frame). 

Finally, we add a time-dependent coherent rotation of 
the qubit, F(t), that constitutes our feedback Hamilto- 
nian. F(t) could be based on the measurement results 
up to t and generates the following dynamics in addition 
to Eqn. (1) [12]: 

[p] fb = -i[F(t),p\. (3) 

We consider feedback of arbitrary strength for conve- 
nience, including infinite strength feedback which is mod- 
eled as instantaneous unitary rotations at any time su- 
perposed on the evolution given by Eqn. (1) 2 . 

Because of rotational invariance about the z axis, going 
away from the x—z plane does not aid purification. Thus, 



2 Protocols that require "infinite" feedback strength may be ap- 
proximated reasonably well in circuit QED [4] , where microwave 
rotations are significantly faster than other relevant timescales, 
namely, \F(t)\ > fc,7i,7^. 
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without loss of generality, we may restrict our attention 
to the Bloch vector components x and z, and consider 
the feedback rotation to be about the y axis. 

Without feedback, the evolution of the Bloch vector 
components of the qubit (p = 4(1 + xa x + ya y + za z )) 
is: 

dx = — (72 + k)xdt — ^2krjxzdW, (4a) 
dz = -(71 + jiz)dt + \/2k~q(l - z 2 )dW, (4b) 

where 72 = 7i/2 + 70 and r = v 1 x 2 + z 2 . 

Using Ito's lemma [ •], we can translate Eqn. (4) 
into the following dynamic equation for the variable 
r = \fx 2 + z 2 , the length of the Bloch vector: 

dr =[r(72 — 71) + k(r — — )}u 2 dt — ^ludt 
r 

+ [fc(" -r)- l2 r]dt+ Jzkq{l - r 2 )udW, (5) 
r 

with u = -. 3 Note that for u = ±1 we have z = ±r 

r 

and the state lies on the z-axis. For — 1 < u < 1, the 
Bloch vector makes a non-zero angle with the z axis. 
The feedback control (which is a unitary rotation) does 
not affect the above dynamical equation for r, but it does 
affect the dynamical equation for u (which is not shown 
here). However, the assumption of arbitrary strength 
feedback simplifies the treatment since it implies that we 
can set u arbitrarily by instantaneous rotation at any 
time. Therefore, we identify r as our state variable and 
u(r,t) e [—1)1] as our control input in the above dy- 
namics. Some of the actual controls we consider for vari- 
ous optimality conditions below require infinite strength 
feedback while others do not, and we will make this re- 
quirement explicit when relevant. 

Finally, we also write down the special case of Eqn. (5) 
with no decoherence for later convenience. That is, when 

7i = 14> = °> 

dr = k(r - -){u 2 - l)dt + y/2k~n(l - r 2 )udW. (6) 



3. THE LOCALLY OPTIMAL STRATEGY 

In this section, we will formulate a locally optimal 
strategy that maximizes the instantaneous rate of purifi- 
cation in the presence of measurement inefficiency and 
decoherence. This strategy is the generalization of the 



3 The apparent singularity at r = can be removed by chang- 
ing the state variable from r to the purity, P, as we will see in 
Eqn. (7). However, because the qubit dynamics are more readily 
visualized in terms of the Bloch vector r, and expressions look 
simpler in r, we will use r as our state variable for the greater 
part of the presentation in this paper (in situations where the 
singularity does not affect our results). Numerical calculations 
of Eqn. (5) are handled by setting u = —1 at the origin to avoid 
the singularity. 



unbiased measurement protocol [5, G] which is also a lo- 
cally optimal strategy, but was also shown to be globally 
optimal for purifying qubits with the max purity goal, 
in the ideal case with no imperfections [11]. In the follow- 
ing sections, we will analyze whether this locally optimal 
strategy is also globally optimal for any goal in the pres- 
ence of imperfections. 

We begin by writing an equation of motion for the 
purity, which is defined as: P = Tr[/? 2 ] = |(1 + r 2 ). 
Using Ito's Lemma and Eqn. (5), we obtain the rate of 
change of purity as: 

dP =[72 - 7i + k(l -2rj + nr 2 )]r 2 u 2 dt - ^rudt 

+ [krj - (72 + k)r 2 ]dt + y / 2fc?7(l - r 2 )rudW. (7) 

This equation consists of a deterministic quantity and a 
stochastic quantity that is proportional to dW(t). The 
former gives the rate of change of average purity, (P), 
since dW(t) averages to zero. Here the angle brackets 
indicates an average over the stochastic noise processes. 
Notice if write r in terms of P on the right hand side, 
r = V2P — 1, this equation can be regarded as the dy- 
namics for the state variable P. Maximizing the instan- 
taneous average purification rate, (P) by choice of rota- 
tions around the y axis is equivalent to maximizing the 
following quadratic function of it: 

f{u) = [72 - 71 + k(l - 27/ + r]r 2 )}r 2 u 2 - jiru. (8) 

The control that maximizes this function can be consid- 
ered a locally optimal strategy since it maximizes the 
instantaneous rate of change in average purity, and we 
label it ui Q (t). It is not a priori clear that such a locally 
optimal strategy will be globally optimal for either the 
max purity or min time purification goals, and we will 
investigate this issue in sections 4 and 5 below. 

3.1. Local optimality in the absence of decoherence 

Consider 71 = 7^ = 0, in which case, f(u) — > 
/no-docohorcnco(u) = k(l - 2r) + r]r 2 )r 2 u 2 . The maximizers 
of this function are easily found and are summarized in 
Table I. Interestingly, the 1/2 < 77 < 1 case introduces 
a fragmentation of the locally optimal control strategy 
that is not present in the ideal case (77 = 1) 4 . Also, Ta- 
ble I shows that if r\ < 1/2, the locally optimal strategy 
is simply to measure diagonally by keeping the state of 
the qubit in the a z basis, i.e. \u(t)\ = 1. The simplest 
way to implement this diagonal measurement strategy is 
to perform an initial instantaneous rotation to the z axis 
(since control is assumed to be instantaneous and at no 
cost) and no successive feedback, and we will call this 



4 We note that Combes and Wiseman have previously suggested 
that such a fragmented, or switching, strategy might be optimal 
in their study of purification under imperfections [14]. 
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the no-feedback diagonal measurement protocol. On the 
other hand, when 77 = 1, which is the ideal case that was 
analyzed in Ref. [5] , we recover the unbiased measure- 
ment protocol as the locally optimal strategy: u(t) = 0. 
This protocol keeps the qubit in an unbiased basis with 
respect to the measurement, and u(t) = =>■ z(t) = 
is maintained by strong rotations. 

In the intermediate case where 1/2 < r\ < 1, a criti- 
cal Bloch vector length emerges, r* = y2 — I/ 7 ?! around 
which the locally optimal strategy switches between the 
diagonal measurement protocol and the unbiased mea- 
surement protocol. That is, when r < r* the feedback 
induced control u — maximizes the rate of change of 
average purity, while when r > r* , the strategy of di- 
agonal measurement maximizes this quantity. We note 
that there is a critical purity corresponding to the critical 
Bloch vector length, given simply by: P* = |(1 + r* 2 ) = 
1(3-1/77). 

A note is in order about the feedback nature of this 
locally optimal strategy. Obviously \u(t)\ = 1 requires 
no feedback since this corresponds to constant rate mea- 
surement along a fixed axis (er z ) and F(t) = 0. u(t) = 
on the other hand requires maintaining the Bloch vector 
along the :r-axis despite measurement-induced fluctua- 
tions causing deviations from this axis. To do this, as 
specified in Jacobs' original unbiased measurement pro- 
tocol [5, 6, 14] the feedback Hamiltonian must be propor- 
tional to the measurement current: F(t) — V2kn^^Jy. 
This is a conditioned rotation since it is inversely pro- 
portional to x(t), the x projection of the Bloch vector 
at the current time instant 5 . However, it should be 
noted that this protocol does not require real-time state 
estimation to execute. This is because in the presence 
of the feedback, when u(t) = 0, the evolution of the x 
component of the Bloch vector is deterministic, since the 
feedback effectively cancels the stochastic component of 
the evolution. In contrast, the locally optimal strategy 
when 1/2 < 77 < 1 requires a switch between the diag- 
onal measurement protocol and the unbiased measure- 
ment protocol when r crosses r* . In order to implement 
this, one requires a real-time estimate of the length of 
the Bloch vector, r(t), which does not evolve determin- 
istically when u 7^ as can be seen from Eqn. (5). Fur- 
thermore, this optimal strategy requires one to rotate the 
state between the x and z axes as the Bloch vector length 
crosses r* . More precisely, for r < r* , the feedback pre- 
scribed by the unbiased measurement protocol maintains 
the state on the x-axis (which is unbiased with respect to 
the measurement along z-axis). When the Bloch vector 
length increases to r > r* , the locally optimal strategy 



5 Technically, this F(t) is an unbounded Hamiltonian since dW(t) 
is unbounded, and furthermore, the initial state x(0) = 0. How- 
ever, it has been shown that tempered approximations of this 
Hamiltonian suffice to implement the unbiased measurement pro- 
tocol [14]. 



prescribes a fast rotation of the state from the x-axis to 
the z-axis (a ir/2 a y rotation) followed by no feedback 
(unless r < r* again at a late time due to the stochas- 
tic evolution of purity under diagonal measurement mea- 
surement). These operations require real-time state esti- 
mation in addition to rotations that take negligible time. 
Such requirements make implementation of the locally 
optimal strategy challenging when 1/2 < 7/ < 1. 

TABLE I: The strategy that maximizes the instantaneous rate 
of increase of average purity (the locally optimal strategy) 
when 71 = 70 = 0. r* = \Jl — I/77 is the critical Bloch vector 
length at which there is a discontinuous change in protocols 
when 1/2 < 77 < 1. 





77 < 1/2 


1/2 < 77 < 1 


77=1 


ui (r,t) 


±1 


J , r<r* 
\±1 , r > r* 






To illustrate the behavior of purification in the case 
when 1/2 < 77 < 1, Figure 1 shows the average rate of 
change of purity as a function of the instantaneous pu- 
rity under both the unbiased measurement protocol and 
the diagonal measurement protocol. The rate of purifi- 
cation decreases with the instantaneous purity for both 
protocols, but while this rate is always positive for the 
diagonal measurement protocol, it can be negative for 
the unbiased measurement protocol when the purity is 
large. This is the reason it is advantageous to switch to 
the diagonal measurement protocol at large purity val- 
ues. Physically, the reason for this switch is that the 
feedback required for the unbiased measurement proto- 
col is non-ideal for inefficient measurement, and hence 
for Bloch vectors that are already large it is preferential 
to switch off the non-ideal feedback. 



3.1.1. Analytical solution for purity evolution when 
1/2 < 77 < 1 

When 7/ < 1/2, the locally optimal strategy is simply 
measurement along a fixed basis, and in this case an an- 
alytical form for the probability distribution for r, the 
length of the Bloch vector, is easily computed and given 
in section 4.1 below (since the evolution is simply diffu- 
sion along the z-axis after a possible initial rotation to 
move the state to this axis). Similarly, when 77 = 1, when 
the locally optimal strategy is the unbiased measurement 
protocol, an expression for the probability distribution 
for r is given in Ref. [5] (not in closed form however). 
Here we complete this characterization and calculate an 
analytical expression for the probability evolution for r 
in the case of qubit evolution under the locally optimal 
strategy when 1/2 < 77 < 1. Note that this case is signif- 
icantly more complicated than the other two mentioned 
above since it involves a switching of protocols around 
the point r* . 
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FIG. 1: Average rate of change of purity as a function of 
instantaneous purity in the presence of measurement ineffi- 
ciency, for the diagonal measurement protocol (\u(r,t)\ = 1) 
and the unbiased measurement protocol (u(r,t) — 0). The 
point where the locally optimal strategy switched between 
these protocols is indicated as P*. Parameters used for this 
plot are k = 1, 7] = 0.8, ji = 70 = 0. 



Consider a known initial state of the qubit on the z-axis 
with r(0) = r (e.g. r Q — when p Q — 1/2). The prob- 
ability distribution function p(r, t) of the Bloch vector 
length at time t is given by the following Fokker-Planck 
equation with the initial condition p(r, 0) = 6(r — ro): 

£p(r,t) = - - r)0(r* - r)p(r,t)} 

at or r 

+ -f^{kr i (l~r 2 ) 2 8(r-r*)p(r,t)}, (9) 



conditions p 2 (r, 0) = 5{r — ro) and pi(0) = 0: 

r V {l~r 2 ) 2 p 2 (r*,t) = ( V - r 2 ) Pl (t), (11a) 

~[kr ] (l-r 2 ) 2 p 2 (r,t)]\r' = ~Pi(t), (Hb) 

~P2(r,t) = ^[kr ] (l-r 2 ) 2 p 2 (r,t)]. (11c) 

Note that in the region r > r* Eqn. (ll)c is the Fokker- 
Planck equation. Eqn. (ll)b enforces probability conser- 
vation at the boundary r = r* . This set of equations 
can be solved by an appropriate change of variables: we 
calculate the solution explicitly in Appendix C. This an- 
alytical solution will be utilized when assessing global 
optimality in the section below. 

Now consider the alternative initial condition where 
r < r* . In this case, the locally optimal strategy im- 
plements the unbiased measurement protocol, which will 
deterministically increase the purity of the qubit accord- 
ing to: 

r(t) = ^Jrj- (r/-r<;)e- 2fet (12) 

This continues until the critical Bloch vector length is 
reached and the locally optimal strategy then switches 
over to the diagonal measurement protocol. Let r(t*) — 
r* . Then, after t* , the non-deterministic diagonal mea- 
surement protocol is employed and we simply solve 
for the qubit dynamics that this prescribes, i.e., solve 
Eqn. (11) for the distribution function with the initial 
condition p 2 (r,t*) = S(r — r*). 

Together with the explicit characterizations of evolu- 
tion under the locally optimal strategies when rj < 1/2 
and 77 = 1 described above, this analytic characteriza- 
tion of the probability distribution for the Bloch vector 
length now provides a complete picture of the evolution 
of qubit purity under the locally optimal strategy for any 
measurement efficiency. 



where 6{x) is the Heaviside step function. 

First consider ro > r* . In this case the locally opti- 
mal strategy implements the diagonal measurement un- 
less the random (diffusive) evolution results in r(t) < r* 
at some future time t. But when this happens the strat- 
egy switches to the unbiased measurement protocol which 
deterministically increases purity until r(t+At) > r* and 
we are returned to the region where the diagonal mea- 
surement protocol is preferred. Therefore, for this initial 
condition r will not go below r* for a finite time. Hence 
we consider the following ansatz for the probability dis- 
tribution function for r: 

p(r,t) = Pl (t)5{r-r*)+p 2 {r,t)6(r-r*). (10) 

Given this ansatz, the Fokker-Planck equation can be 
translated into the following set of equations with initial 



3.2. Local optimality in the presence of 
decoherence 

In this subsection, we expand our analysis of locally 
optimal strategies that maximize the instantaneous rate 
of increase of average purity to the case where decoher- 
ence is present. Recall that the general expression for 
instantaneous average purification rate, which we want 
to maximize, is given in Eqn. (8). For general 71, 7^, 
and 77, in order to maximize f(u) we require: 

ui t rt \-\ _1 if 71 + 2r l72 - 7i + k(l -2tj + r?r 2 )] > 
Ul ° ~ I 2r h2 - 11+ k(i^ v+ ^)] otherwise 

For the explicit calculations in this work, we will focus 
on the most realistic situation, where 71 7^ 0, 7^ 7^ 0, 
and 77 < 1. 
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Here, the counterpart of the regime where 77 < 1/2 
in the previous case of no decoherence is the parameter 
space that yields ui (r, t) = — 1. A sufficient condition for 
this is 70 + k(l — 2-q) > 0. In the presence of decoherence, 
the term —^\ru term in f(u) (see Eqn. (8)) ensures that 
u = +1 is no longer a maximizing control. Physically, 
this simply means that because of uncontrollable relax- 
ation (a T\ process), it is not advantageous to attempt to 
purify to the excited state of the qubit. Instead, main- 
taining u — — 1 ==> z < is the locally optimal strategy. 
Now to keep the z-component of the qubit negative, we 
must flip the qubit (e.g., apply tt rotation around y axis) 
whenever our o~ z measurement indicates that it is posi- 
tive. Therefore this control corresponds to simply having 
a constant a z measurement (which induces diffusive mo- 
tion along the z axis) that is interrupted by strong o~ y 
rotations whenever z(t) > 0. Such a feedback protocol 
requires continuous real-time state estimation. We shall 
refer to this strategy of maintaining the z-component of 
the qubit negative as the negative diagonal measurement 
protocol. 

The negative diagonal measurement protocol is ex- 
pected to have better performance than the no-feedback 
diagonal measurement protocol 6 both locally and glob- 
ally, as long as 71 > 0. The a z measurement has a 
chance of moving the qubit to the z > region and when 
this happens, the strong feedback rotation in the nega- 
tive diagonal measurement protocol will change the term 
— 71 z in Eqn. (7) to +712:, while keeping all others invari- 
ant. This will generate a larger instantaneous purifica- 
tion rate. Therefore, for any noise realization dW(t), the 
negative diagonal measurement protocol gives a trajec- 
tory that has a purity larger than or equal to evolution 
under the no-feedback diagonal protocol at any time. We 
will see this manifest in the numerical simulations assess- 
ing global optimality in sections 4 and 5. 

In the parameter space where u; (r, t) ^ 1, it/ (r, t) 
has complicated dependence on r. We have plotted its 
value for a typical set of parameters in Fig. 4(a). When 
the condition 71 + 2r[j2 — 7i + k(l — 2r\ + rjr 2 )} > is 
not met, one must perform precise rotations around the 
y axis so that u = tt-, -rSk — n~z r — irr is maintained. 

a 2r[7 2 — 7i+fc(l — 2T)+r\r i )\ 

This also requires continuous real-time state estimation, 
and furthermore, requires precise knowledge of all the 
parameters in the system. Executing this locally opti- 
mal strategy in the presence of decoherence is hence very 
challenging from a practical standpoint. 

The locally optimal strategy in the presence of deco- 
herence is significantly more complex than that in the 
absence of decoherence. As a result we have been unable 
to formulate an analytical solution for the probability 



While the no-feedback diagonal measurement may come with an 
initial rotation to either +2 or —2 axis in the decoherence free 
regime, we will choose the more advantageous initial rotation to 
the —2 axis here. 



distribution of the Bloch vector length in this case. 



4. GLOBAL OPTIMALITY FOR THE max purity 
GOAL 

In this section, we will consider the max purity purifi- 
cation goal, i.e., to maximize P(t) = 2 + 5/0 drr 2 p(r,t) 
with a fixed purification time t, and ask whether the lo- 
cally optimal strategies formulated in the previous sec- 
tions are globally optimal for this goal. We note that 
in the ideal case (no decoherence and measurement ef- 
ficiency T) — 1) global optimality of the locally optimal 
strategy (i.e., the unbiased measurement protocol when 
•q = 1,71 — 70 = 0) was proven in Ref. [ ]. In the 
following subsections we investigate the extent of global 
optimality in other parameter regimes. 



4.1. No decoherence and n < 1/2 

In this regime, we found the locally optimal control to 
be u(r,t) = ±1, which is the diagonal measurement pro- 
tocol. Here we use the verification theorem to prove that 
the locally optimal solution is actually globally optimal. 
Appendix A reviews the verification theorem [15] and 
provides a simplified form that is useful for the present 
calculations. The verification theorem provides a suffi- 
cient set of criteria to test the global optimality of a pre- 
sumed solution. We will use P instead of r as the state 
variable in this subsection to avoid complications deriv- 
ing from the r = singularity in the dynamical equation 
for r (see Eqn. (5)). At the first step, we need to calculate 
a cost function C(P,t), which is defined as the average 
impurity (L(T)) = 1 - (P(T)) at time T, given that the 
purity is P at time t. For an arbitrary initial state, the 
no-feedback diagonal measurement protocol specifies an 
initial rotation of the qubit to the z-axis and then simple 
measurement in the z-basis (no feedback). Since the ini- 
tial rotation is assumed to be instantaneous, the dynam- 
ics under this protocol is the subsequent motion along the 
z axis. This density matrix evolution under measurement 
alone can be solved with the method of linear quantum 
trajectories [17]. Given z(0 + ) = zo (the + time simply 
indicates the time after the instantaneous rotation to the 
x-axis), z(t) can be written as: 

z(R(t)) = tanh(arctanh(z ) + y/2kr)R(t)), (13) 

where R(t) is a random variable whose distribution func- 
tion at time t is given by: 

• cosh(arctanh(2;o) + y/2krjR). (14) 
We find the cost function C(P, t) using this distribution 
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as 



C{P,t) 



VMT - 1) 

sech [ar ct anh ( y2P — 1 

i? 2 



+00 



• exp[- 



2(T-t) 



\dR. 




The G function for our dynamics (7), defined in ap- 
pendix A, is related to the derivatives of the cost function 
by: 



G(P,t) =-fc(2P-l) 

• MP - !) : 



, d 2 c 

dP 2 



(1-3^ + 27^)—] 



fc(l + 77- 2P) 



dC 
dP' 



(16) 



The derivatives and are continuous over the in- 
terval [0, T) as required by the verification theorem. The 
cost function in (15) gives the G function a nonnegative 
coefficient in front of u 2 (including the minus sign) for all 
P and t. The explicit form of this coefficient is derived 
in Appendix B. Therefore, u(P,t) = ±1 are the maxi- 
mizers of the G function, and the verification procedure 
concludes that the corresponding diagonal measurement 
protocol is globally optimal in this parameter regime. 



4.2. No decoherence and 1/2 < 77 < 1 

In the regime 1/2 < 7/ < 1, the locally optimal strat- 
egy combines the diagonal measurement protocol and the 
unbiased measurement protocol, with a switch between 
these at a critical Bloch vector length r* . In this case, 
however, one can show that this locally optimal strategy 
is not globally optimal. To do so, we can solve for aver- 
age purity as a function of time and compare it against 
the corresponding purity derived from other protocols. 
In section 3.1.1 we obtained an analytical form for the 
probability distribution for the Bloch vector length as 
a function of time, p(r,t), when using the locally optial 
strategy. With this distribution function, the average 
purity, (P(t;)), can be calculated by a simple integral. 

Figures 2(a) and 2(b) show how the average purity 
evolves as a function of time (for three different control 
strategies) in the absence of decoherence, with parame- 
ters chosen so that ro < r* and ro > r* , respectively. 
We have tested a wide range of parameter values and the 
results are qualitatively the same throughout this regime 
of measurement efficiency (1/2 < 77 < 1). With r < r* , 
the locally optimal strategy initially outperforms the di- 
agonal measurement protocol as expected. However, the 
diagonal measurement protocol (i.e. constant <r z mea- 
surement) catches up later and purifies more effectively 
at late times. The catch up time occurs before the Bloch 
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FIG. 2: Average purity vs. time in the case of no decoherence 
for the unbiased measurement protocol, the locally optimal 
strategy, and the diagonal measurement protocol for two dif- 
ferent initial purities. The parameters used here are k — 1, 
77 = 0.84 (r* = 0.9), 71 = 70 = 0. 



vector length reaches r* using the locally optimal strat- 
egy. When ro > r* , Fig. 2(b), the locally optimal strat- 
egy never outperforms the diagonal measurement proto- 
col 7 . 



7 For very short times the difference between purities for the two 



The fact that there is a performance difference between 
the locally optimal strategy and the diagonal measure- 
ment protocol when r > r* needs some explanation. 
When ro > r* , initially the locally optimal strategy is 
exactly the diagonal measurement protocol. However, as 
time progresses, while the diagonal measurement proto- 
col simply causes diffusion of r(t) along the z-axis, the 
locally optimal strategy switches to the unbiased mea- 
surement protocol if r(t) drops below r* at any future 
time. The subsequent deterministic increase of purity 
caused by the unbiased measurement protocol results in 
r(t + At) > r* again for some small At. Then the switch 
back to the diagonal measurement protocol causes diffu- 
sion of r(t) again. The net result of this switching back- 
and-forth at the boundary is a build-up of probability at 
r = r* at intermediate times. This concentration of prob- 
ability weight at r = r* gives rise to a smaller cumulative 
probability in the r > r* region than the diagonal mea- 
surement protocol. This is illustrated in Figure 3, which 
shows the time-development of the probability distribu- 
tion, p(r,t), when ro > r* for both the locally optimal 
strategy and the diagonal measurement protocol. This 
probability concentration at the boundary is an interest- 
ing consequence of the switching behavior of the locally 
optimal strategy. This example demonstrates that while 
protocol switching can lead to local optimality, it can be 
detrimental to global optimality in some instances. 

We also note that the unbiased measurement protocol 
eventually performs worse than both of the other pro- 
tocols (diagonal measurement and the locally optimal 
strategy), regardless of the initial state. This demon- 
strates that using the optimal strategy derived for per- 
fect efficiency measurements rj = 1 can be inappropriate 
if actually the measurement efficiency is less than unity. 

From the above analysis we conclude that for the max 
purity goal, the locally optimal strategy, which maxi- 
mizes the rate of increase of average purity at any time 
instant, is also globally optimal for both rj < 1/2 and 
T) = 1. Therefore, feedback control is not helpful at all 
for the max purity goal when rj < 1/2. Instead, the lo- 
cally optimal strategy in this regime is simple measure- 
ment. In contrast, in the regime 1/2 < rj < I, we cannot 
conclude that any strategy is globally optimal. 



4.3. In the presence of decoherence 

We have not been able to apply the verification the- 
orem to prove global optimality of the locally opti- 
mal strategy for the max purity goal in any parame- 
ter regime in the presence of decoherence. However, we 
speculate that in the parameter regime where the neg- 
ative diagonal measurement protocol is the locally opti- 
mal strategy (this regime is specified by the inequality 



protocols is within numerical error. 



7i + 2r[j 2 - 7i + fc(l - 2r/ + rjr 2 )] > 0,Vr), that it is 
also the global optimum. This is because in this regime 
it is advantageous to cooperate with the relaxation pro- 
cess, which also induces purification, and this is precisely 
what the negative diagonal measurement protocol does. 
When the locally optimal strategy has a more compli- 
cated r dependence (e.g., that in Fig. 4(a)), we numeri- 
cally simulate the performance of the four different pro- 
tocols we considered so far (using the Euler-Maruyama 
algorithm [IN] for those protocols involving stochastic in- 
tegration). Fig. 4(b) shows the behavior of average pu- 
rity as a function of time for a typical set of parameters. 
The performance of the free evolution (with no measure- 
ment at all) is also added for comparison, since the re- 
laxation dynamics itself induces purification. The neg- 
ative diagonal measurement protocol is superior to the 
no-feedback diagonal measurement protocol as expected. 
It also achieves greater values of purity than the locally 
optimal strategy in the long run, similar to the situation 
found in the decoherence free case. The locally opti- 
mal strategy is also superior to the no-feedback diagonal 
measurement strategy 8 . The above facts suggest that, 
in the presence of decoherence, qubit purification mea- 
sured by the max purity goal will always benefit from 
feedback, even though in the absence of decoherence the 
corresponding preferred purification strategy may be no 
feedback (at long times). 



5. GLOBAL OPTIMALITY FOR THE min time 
GOAL 

In this section, we consider the min time goal, i.e., 
to minimize — J tdq(t), where q(t) is the probability of 
not reaching a fixed target Bloch vector length at time t. 
We will determine whether the locally optimal strategies 
identified above are globally optimal in any parameter 
regime for this goal 9 . 



5.1. In the absence of decoherence 

For the min time purification goal in the absence of 
decoherence, Wiseman and Bouten proved that the di- 
agonal measurement protocol is globally optimal for per- 
fectly efficient measurements, rj = I [II]. We show here 
that in the absence of decoherence this is true for all 
values of measurement efficiency 77 using the verification 
theorem. In this subsection, we will again use P as the 
state variable, and r should be interpreted as a function 
of P, namely, r = \J2P — I . 



8 At least when the decoherence rates are not too small 

9 We note that a convenient feature of the min time goal is that its 
evaluation does not depend on the definition of purity used; any 
definition of purity as an increasing function of r is equivalent. 
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FIG. 3: Behavior of the solution to the Fokker-Planck equation, Eqn. (9) at three different times t, visualized by the distribution 
of s(t) = arctanh(r(t)) as the qubit evolves under the locally optimal and the diagonal measurement protocols. Both protocols 
operate on a qubit initially at the same purity, with ro = 0.95. Parameters used are the same as in Fig. 2, namely k = 1, 
r] = 0.84 (r* = 0.9), and 71 = 7^ = 0. 



Let Tj be the final Bloch vector length that we want 
to achieve. The cost function C(P,t) for this goal is the 
average remaining time of the first passage through rf 
(rf < 1), given that the qubit is at r — y/2P — 1 (r < 
rf) at time t. We took the approach given in Ref. [10] 
to calculate this min time cost, and solved the Fokker- 
Planck equation with absorbing boundary conditions at 
r = Tf. If p(r,t) is the solution of this equation, then 



the probability of not hitting the target rf by time t 
is q(t) = J_ r p(r, t)dr. The average time of reaching 
the boundary is then obtained from q(t) by integration, 
namely — J °° tq(t)dt = J °° q(t)dt. 



After the initial rotation to the z-axis that the diago- 
nal measurement protocol prescribes, z(0 + ) = r. Then 
C(P, t) can be obtained as described above, and takes 
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FIG. 4: Performance comparison of the free evolution, the locally optimal strategy, the no-feedback diagonal measurement, the 
unbiased measurement, and the negative diagonal measurement protocols in the presence of decoherence for initial condition 
po = 1/2 (ro = 0). The chosen parameters are k = 1, 7] = 0.91, 71 = 0.2, 72 = 0.3. (a) plots the locally optimal strategy for 
this chosen set of parameters. Notice the switch from the preferred negative diagonal measurement control at small and large r 
values (ui (r,t) = — 1) to a more complex preferred control (ui (r,t) > — 1) for intermediate values of r. (b) shows the average 
purity (over 40,000 runs) vs. time and (c) shows the average purification time (over 20,000 runs) as a function of the target 
Bloch vector length for the five protocols. The statistical variation in the simulations is comparable to the line-width of the 
plots, so error bars are not explicitly shown. 



the form 

C(P,t) — (r/ X arctanh(r/) — r X arctanh(r))/2fej7. (17) 

Substituting this form into the G function, Eq. (16), 
results in the following expression for the coefficient of 



,2. 



1 



1 r 

2\ 



1 



)(arctanhr 



1 



(18) 



where r <E [0, rf]. 

It can be shown that Eq. (18) is nonnegative if 77 = 1. 
Since it is a decreasing function of ?y, the expression is also 
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nonnegative for all rj. The diagonal measurement proto- 
col thus maximizes the function G, and by the verification 
theorem (appendix A) it is globally optimal. Therefore 
in the absence of decoherence, feedback is not beneficial 
for the min time goal, regardless of the efficiency of mea- 
surement. 



5.2. In the presence of decoherence 

In the presence of decoherence we cannot prove global 
optimality of any of the protocols, but we do have strong 
numerical evidence suggesting that the negative diago- 
nal protocol is globally optimal for the min time goal 
in the presence of relaxation. We numerically simulated 
the qubit purification under the locally optimal, the no- 
feedback diagonal, unbiased measurement, and the neg- 
ative diagonal measurement protocols with the Euler- 
Maruyama algorithm for many combinations of param- 
eters. In Figure 4(c) we show results for one particular 
parameter combination, but qualitatively similar results 
are obtained for all parameter regimes simulated. We 
find that the negative diagonal measurement protocol is 
always superior to the other alternatives, particularly the 
locally optimal one and the no-feedback diagonal one. 
Therefore, we speculate that the negative diagonal mea- 
surement protocol constitutes the globally optimal strat- 
egy for the min time problem, regardless of decoherence 
rates or measurement efficiency. We also conclude that 
feedback is likely advantageous for the min time goal in 
the presence of decoherence. 



6. CONCLUSIONS 

Optimal feedback control can provide a crucial ele- 
ment of precision control in solid-state quantum systems, 
where the measurement process typically requires a non- 
negligible time to complete. In this work, we studied the 
optimality of feedback control protocols for qubit purifi- 
cation in the presence of decoherence and with realistic 
detectors characterized by non- ideal efficiency {rj < 1). 
We considered the two control goals: (i) maximizing the 
average purity at a target time (max purity) and (ii) 
minimizing the average time to reach a specific purity 
threshold (min time). 

When environmental decoherence is negligible and the 
only source of decoherence is the measurement back- 
action, we arrived at the following conclusions. For the 
max purity goal and detector efficiency less than 1, we 
found that the globally optimal protocol is significantly 
different from the unbiased measurement protocol that 
is known to be globally optimal for this goal when r\ = 1 
[11]. This underscores the fact that one should be care- 
ful when extrapolating the optimality of feedback control 
protocols from idealized to realistic scenarios. The diag- 
onal measurement protocol, which is an initial rotation 
to the z-axis followed by measurement in the z basis, is 



optimal when rj < 1/2, as analytically verified by the ver- 
ification theorem. The diagonal measurement protocol 
also performs very well when 1/2 < rj < 1, where indeed 
it outperforms the locally optimal protocol in the long 
time limit. We were however unable to find the global 
optimal solution in this regime. In contrast, the situa- 
tion for the min time purification goal is quite different. 
Here the diagonal measurement protocol is known to be 
(globally) optimal solution for an ideal detector [11]. In 
this work we showed that this optimality under the most 
ideal conditions holds for all values of detector efficiency 
rj, as long as no decoherence is present. 

We then explored the effects of decoherence on the op- 
timal feedback strategies in addition to non- ideal (77 < 1) 
detectors. The decoherence sources were modeled by 
independent dephasing and relaxation processes under 
Markovian conditions. Here extensive numerical sim- 
ulations show that the negative diagonal measurement 
protocol, which is designed to maintain the qubit in the 
negative segment of the z axis by strong feedback, out- 
performs the other three feedback strategies (unbiased 
measurement, no-feedback diagonal, and locally optimal) 
for the min time goal, and for the max purity goal in 
the long time limit. Similar to the decoherence free case, 
there is a regime where the negative diagonal measure- 
ment protocol is not locally optimal. (It typically hap- 
pens when the measurement efficiency is high and de- 
coherence is weak.) Nevertheless, the negative diago- 
nal measurement protocol achieves better average purity 
than the locally optimal one after certain time. The neg- 
ative diagonal measurement protocol's good performance 
for both problems strongly suggests that feedback is use- 
ful in the presence of decoherence. 

An interesting aspect of the study presented here is 
the behavior of the locally optimal protocol when 1/2 < 
77 < 1, and in the absence of decoherence. This protocol 
involves switching between two strategies, unbiased mea- 
surement and diagonal measurement, which corresponds 
to a switching between regions where the purity increases 
ballistically and diffusively, respectively. We demon- 
strated that this switching behavior results in a concen- 
tration of probability (of purity) around the boundary 
that defines the switching behavior. This novel aspect 
results directly from the dynamic switching of protocols 
and to the best of our knowledge has not been explored 
in other quantum control contexts. 

Overall, this work extends prior optimal control results 
in the quantum realm to include realistic experimental 
conditions and shows that significant modifications of op- 
timal feedback control strategies can arise in the presence 
of decoherence. In future it will be interesting to further 
extend these studies to analysis for qubits coupled to non- 
Markovian environments, to determine how the detailed 
behavior of an environment may enter the optimal con- 
trol strategy. 
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Appendix A: The Verification Theorem 

In this appendix, we review the procedure for verifying 
the (global) optimality of a given solution to a stochastic 
control problem. An introduction to this topic can be 
found in Rcf. [ ]. Consider the general dynamical equa- 
tion for a stochastic system 

efoc = A(t, x, u(x, t))dt + B(t, x, u(x, t))dW. (Al) 

Here the state of the system is given by the vector x, 
and the vector u(x, t)) denotes the control inputs. (The 
region that bounds u(x, i)) shall not depend on x or t.) 
The vectors A and B are coefficients of the deterministic 
and stochastic parts of the dynamics, respectively. 
The control objective is to minimize a cost, J: 



J 



L(x, u(x, s),s)ds + M(x(T)) 



(A2) 



where L(x, u(x, t), t) is the cost rate, usually the con- 
sumed energy penalty, and M(x(T)) is the cost of the 
final state at time T. 

The cost function, C(x, i), is defined as the partial cost 
over the interval [t, T] , given that the system is at state 
x at time t: 



C(x,t) 



Lds + M(x(T)) 



(A3) 



To determine whether a given control protocol, u c (x, t) 
is optimal, one performs the following three steps: 

1. Integrate the equations of motion of the system to 
calculate the cost function, C(x, t), for this protocol. 

2. Check that C satisfies two continuity conditions: 



dC 



and 



d 2 C 
dx 2 



(A4) 



are continuous. Here d 2 C/dx 2 denotes the matrix of sec- 
ond derivatives of C. 

3. Determine whether or not v(x, t) = u c (x,t) is a 
maximizer of the following function of v: 



G(i,x,v)=-^Tr 



d 2 C 

B t (t,x,v)^B(i,x,v) 



(A5) 



Note that one must check that u c (x, t) maximizes G 
separately at each time t and at each value of x. 

The above verification procedure provides a sufficient 
condition for a control strategy to be optimal. In Rcf. [ ] , 
the procedure has four steps, and we have removed the 
third step by realizing that the Hamilton- Jacobi-Bcllman 
equation is automatically satisfied if the u c maximizes 
Eq. (A5). 

For a time-optimal control problem where the goal 
is to minimizing the average time taken for a function 
h(x(t),t) of the dynamical variables (and perhaps of 
time) to cross a fixed threshold h c , the same three-step 
verification procedure still holds. Please note that the 
cost function C(x, t) should be defined as the average re- 
maining time it will take to cross the threshold, given 
that the current time is t and current state is x. The 
corresponding function G in step 3 is defined as 



G(t,x,v) = -i'& 



Bt(t,x,v)^-B(t,x,v 



dC 
dx 
(A6) 



Appendix B: The Coefficient of the u 2 Term 

In this appendix, we will prove that the coefficient of 
the u 2 term in Equation 16 is nonnegative for all P G 
r? € [0,0.5], k > 0, and T-t > with C(r,t) given 
by Equation 15. 

The coefficient works out to be the following expression 
up to a positive factor: 

»+oo 

{2?7r(l — r 2 )sech 2 [arctanh(r) 
2 — ? ?){ r + tanh[arctanh(r 



• sech[arctanh(r) 




^fl]exp(- 2 " )dR. (Bl) 



First, we ignore r) in the expression kr\ because it can 
always be absorbed by k, and then we divide the above 
expression by rj. It is easy to see that the resultant ex- 
pression is a decreasing function of 77, because the follow- 
ing integral is nonnegative from symmetry analysis: 



+00 



tanh[arctanh(r) + \J 2ki]R] 



■ sech[arctanh(r) + \J 2krjR] exp(- 



R 2 



2(T - t) 



)dR. (B2) 



Therefore, we only need to prove the positivity of Ex- 
pression Bl for 77 = h. Expanding the hyperbolic func- 
tions with the addition formulas yields the following ex- 
pression for the integrand: 



2r 5 



(2r 2 - 1) (3r 2 + 1) tanh R + r (5r 2 - 3) tanh 2 R 



\/l — r 2 sechi? exp( 



2(1 + rtanhi?) 3 

R 2 , 



2{T-t) 



(B3) 
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Because an odd function of R contribute nothing to 
the integral, we only need to take into account the even 
component of the integrand, which turns out to be the 
following expression up to a positive factor: 



(-6r 7 + 16r 5 - 8r 3 ) tanh 4 R 

+ (6r 7 - 18r 5 + 8r 3 ) tanh 2 R + 2r 5 



(B4) 



From the properties of parabolic functions, it can be 
shown that the above expression, and thus the even com- 
ponent of the integrand, is nonnegative for all R. There- 
fore, Expression Bl is nonnegative. 



is a solution to the same heat equation (Clb) with the 
boundary condition (Cla) replaced by q(s*,t) = 0. We 
can express Q in terms of q as: 



1 r x 
Q(s, t) = - — I cxp( — )q(s + x, t ■ 



Bx 



)dx 



exp(--^)/(^s 



t), 



(C2) 



where f(x) is an arbitrary function of x. 

The solution to the heat equation with no boundary is 
given by: 



Appendix C: Explicit Solution of Equation (11) 

Eqs. (11a) and (lib) can be combined to give a bound- 
ary condition for p2 (r, t) . Then we would like to make 
a change of variables. Let r — tanh s (similarly for r* 
and ro), and let P2{r, t) — sechsoe~ kr,t cosh 3 sQ(s, t). Eq. 
(11) gets translated into the following equations with the 
initial condition Q(s,0) = S(s — sq). 

[-Q(s,t)-A— Q(s,t)+ B—Q{s, i)]| s=s . =0, (Cla) 

^Q( S ,t) = kr 1 ^Q(s,t),s>s*, (Clb) 

where A — csch 2 s* coths*, B — coth 2 s* /(krj). 

If Q{s, t) is a solution to the above equations, it is easy 
to see that q(s, t) = -Q(s, t) - A§ s Q{s 1 1) + Bf t Q{s, t) 



)(«,*) = 



1 



, ( s ~ s o) 2 ^ 



(C3) 



Let q (s, t) = -Q (s, t) - A§- s Q (s, t) + B§- t Q Q (s, t). 
We set q(s,t) = qo(s 7 t) — qo(2s* — s,t) so it satisfies the 
heat equation and vanishes at s* . In order for Q(s,t) in 
Eq. (C2) to meet the initial condition, we also need to 
set f(x) = Dexpi^^x), where: 



C - 
D 



A - y/A 2 + ABrjk 
2Br]k ' 
2ACexp[C(s -2s*)} 
y/A 2 + 4Brjk 



(C4a) 
(C4b) 



It can be verified that the resulting Q(s,t) given by 
Eq. (C2) is the desired solution. 
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